Equivalence of Different Methods for Slant and Skew Corrections in Word Recognition Applications
نویسندگان
چکیده
ÐNormalization of slant and skew is often used in processing a word image before recognition. In this paper, we prove the theoretical equivalence of different methods for slant and skew corrections. In particular, we show that correcting first for skew by rotation and then for slant by a shear transformation in the horizontal direction is equivalent to first correcting for slant by a shear transformation in the horizontal direction and then for skew by a shear transformation in the vertical direction. Our proof can be easily modified to prove equivalence of other methods for correcting the slant and skew. Index TermsÐImage preprocessing, slant normalization, skew normalization, handwriting recognition. æ 1 INTRODUCTION IN most handwritten word applications, correcting the skew (deviation of the baseline from the horizontal directionÐFig. 1) and the slant (deviation of average near-vertical strokes from the vertical directionÐFig. 2) is an important preprocessing step. Typically, first the skew angle and then the slant angle are corrected [1], [3], [7]. In some cases, however, it might be more convenient to first correct the slant and then the skew [8]. In this paper, we prove the theoretical equivalence of these approaches showing that the order and the method in which the slant and skew angles are corrected is not important for word recognition applications. Specifically, we consider the following two methods of correcting both the slant and skew angles: 1) the first standard method corrects the skew angle by rotation and then corrects the slant angle by a shear transformation in the horizontal direction [1], [3] 2) the second method first corrects the slant angle by sheartransforming the entire image in the horizontal direction and then corrects the skew angle by shear-transforming the entire image in the vertical direction [8]. Assuming that the pixel coordinates are real numbers, we will prove that the two images produced 1) by rotation followed by a shear transformation in x-direction and 2) by a shear transformation in x-direction followed by a shear transformation in y-direction differ only in their aspect ratios. And if reasonable limits are imposed on the possible values of skew and slant angles, the changes in the aspect ratio are negligible. For example, if the skew angle is 10 and the slant angle is 30, the two aspect ratios would differ by only a multiplicative factor of 0:85. Rounding of coordinates will ultimately produce slightly different images. This, however, should not influence a word recognizer in any significant way. High-level features [1], [2] (loops, ascenders, descenders, etc.) will remain unchanged while low-level features [4], [5], [6], [9] (various moments, distribution of slopes, or pixels in subregions of each segment, etc.) will change only slightly. 2 MATHEMATICAL FORMULATION To justify the above claims, let us consider a general image of a handwritten word where the deviation of the baseline from the horizontal axis is and the deviation of the average direction of near-vertical strokes from the line perpendicular to the baseline is Ðsee the top image in Fig. 3. Assume that the baseline intersects the origin of the coordinate system. This assumption will only simplify our formulas and has certainly no effect on the generality of our results. 2.1 First Method Images in Fig. 3 show the individual steps of the first (traditional) method of correcting first the skew angle by rotation and then the slant angle by shear transformation in the horizontal direction. Given a point x; y, its new coordinates x0; y0 after rotating the entire image around the origin by angle ÿ (negative ) satisfy x0 y0 cos sin ÿ sin cos x y : After shear-transforming the point in the horizontal direction by angle ÿ (negative ), its new coordinates x00; y00 will be x00 y00 1 ÿ tan 0 1 x0 y0 T x y ;
منابع مشابه
Offline English Handwritten Word Recognizer Using Best Feature Extraction
Over several years lot of research has been done in handwritten word recognition but yet need for higher recognition rate still exist. In this paper we have proposed a method that performs preprocessing steps like skew and slant correction. It uses best structural features for feature extraction .Euclidean distance method is applied for classification that produces single matching word having m...
متن کاملSkew and Slant Corrections for IR Images in Indian License Plate Recognition Using HCD & HPR
Character segmentation and detection play a very important role in the license plate recognition (LPR) system. For accurate character segmentation first need is to remove skew from the received license plate for the night vision images. Skew is the angle by which the image seems to be deviated from its perceived steady state position. To solve the problem, a combined approach for skew correctio...
متن کاملA Novel Image Matching Approach for Word Spotting
A Novel Image Matching Approach for Word Spotting Muhammad Ismail Shah Word spotting has been adopted and used by various researchers as a complementary technique to Optical Character Recognition for document analysis and retrieval. The various applications of word spotting include document indexing, image retrieval and information filtering. The important factors in word spotting techniques ar...
متن کاملAn Integrated System for Handwritten Document Image Processing
In this paper we attempt to face common problems of handwritten documents such as nonparallel text lines in a page, hill and dale writing, slanted and connected characters. Towards this end an integrated system for document image preprocessing is presented. This system consists of the following modules: skew angle estimation and correction, line and word segmentation, slope and slant correction...
متن کاملMultiple classifier systems in offline cursive handwriting recognition
The thesis addresses the application of multiple classifier systems (MCS) methods in the domain of handwriting recognition. To evaluate the MCS methods two different state-of-the-art handwritten word recognizers are used. Both recognizers are based on Hidden Markov Models (HMMs) and process handwritten words in the same manner: First the word image is normalized in respect to slant, skew and he...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Pattern Anal. Mach. Intell.
دوره 23 شماره
صفحات -
تاریخ انتشار 2001